A Simulation Study to Evaluate the Robustness of Recent Methods for Preparing Variance Estimates in the Presence of Hot Deck Imputation

نویسندگان

  • Michael Sinclair
  • Nuria Diaz-Tena
  • Alexander Park
چکیده

Many large-scale surveys currently use a variety of single imputation methods–as discussed by Chapman (1976), Cox (1980; and Kalton and Kasprzyk (1986)—to handle item nonresponse. Since the use of such imputation increases the underlying variation in the survey results, methods are needed to assess the impact. Until fairly recently, methods to assess the impact of the imputation on the variance have not been available. Rao and Shao (1992) and Shao (2002), presented a method to measure the variance of an estimate due to the combined effect of the sample design and the use of imputation to compensate for item nonresponse. This method discussed in section II, is based on the use of a replication method of variance estimation combined with specific adjustments to the imputed values. Our research sought to explore the use of this method on the expenditure data collected in the Medical Expenditure Panel Survey (MEPS), sponsored by the Agency for Healthcare Research and Quality (AHRQ). Since the MEPS utilizes a single hot deckimputation method, and the sample design and the data files were structured to facilitate the use of a replicate variance estimation method, the survey met the basic requirements to apply Shao’s method. This paper presents results from a simulation study conducted to evaluate the feasibility of applying Shao’s procedure to MEPS data. We will discuss, that the MEPS imputation procedures do not meet all of the methodological assumptions given by Shao. In particular, Shao’s method assumes the covariates used in the imputation process are fully reported and that variance estimates are needed only for univariate statistics. The goal of our simulations was to quantify the biases in estimates of variance when these assumptions were violated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تحلیل مشاهدات گمشده در مطالعه اثر دوزهای مختلف مکمل ویتامین D بر مقاومت به انسولین در دوران بارداری

Introduction: The aim  of  this  study  was to impute missing data  and  to compare the effect  of  different doses of  vitamin D supplementation on  insulin resistance during  pregnancy. Methods: A clinical trial  study   was done on 104  women  with diabetes and gestational age less than 12 weeks between 1391 and...

متن کامل

An Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods

Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...

متن کامل

The analysis of missing data in public use survey databases : a survey of statistical methods

THE ANALYSIS OF MISSING DATA IN PUBLIC USE SURVEY DATABASES: A SURVEY OF STATISTICAL METHODS Ping Xu November 20, 2004 Missing data is very common in survey research. However, currently few guidelines exist with regard to the diagnosis and remedy to missing data in survey research. The goal of the thesis was to investigate properties and effects of three selected missing data handling technique...

متن کامل

Imputation methods for quantile estimation under missing at random

Imputation is frequently used to handle missing data for which multiple imputation is a popular technique. We propose a fractional hot deck imputation which produces a valid variance estimator for quantiles. In the proposed method, the imputed values are chosen from the set of respondents and are assigned with proper fractional weights that use a density function for the working model. In addit...

متن کامل

Hot Deck imputation for multivariate missing data

Fractional hot deck imputation, considered in Fuller and Kim (2005), is extended to multivariate missing data. The joint distribution of the study items is nonparametrically estimated using a discrete approximation, where the discrete transformation also serves to define imputation cells. The procedure first estimates the probabilities for the cells and then imputes real observations for missin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002